Systems and methods for remote detection of software through browser webinjects

ABSTRACT

Computer-implemented methods and systems are provided for the detection of software presence remotely through the web browser by detecting the presence of webinjects in a web browser that visits a detection webpage. The methods can include delivering a detection webpage to a web browser, in which the detection webpage has detection code configured to detect a presence of the webinject in the detection webpage; and inspecting, by the detection code, rendering of content of the detection webpage in the browser to detect webinject content in the detection webpage by the webinject, the webinject content including one or more Hypertext Markup Language (HTML) components. The method can further include, if webinject content is detected, generating a fingerprint for each of the one or more HTML components; transmitting the one or more fingerprints to an external server; and classifying, by the external server, the webinject based on the one or more fingerprints.

CROSS-REFERENCE TO RELATED APPLICATION

The present application is a continuation of U.S. patent applicationSer. No. 17/000,135, filed Aug. 21, 2020, which is a continuation ofU.S. patent application Ser. No. 16/688,647, filed Nov. 19, 2019, nowU.S. Pat. No. 10,776,483, issued Sep. 15, 2020 which is a continuationof U.S. patent application Ser. No. 16/170,680, filed Oct. 25, 2018, nowU.S. Pat. No. 10,521,583, issued Dec. 31, 2019 and titled “Systems andMethods for Remote Detection of Software Through Browser Webinjects,”the entirety of which are incorporated herein by reference.

TECHNICAL FIELD

The following disclosure is directed to methods and systems for thedetection of software remotely through a web browser and, morespecifically, methods and systems for detection of software remotelythrough a web browser by detecting the presence of webinjects in a webbrowser.

BACKGROUND

Modern software often uses webinjects to change with a user's webbrowsing experience. Examples of such software include malware, adware,browser extensions, and anti-virus programs. Webinjects are pieces offoreign code, e.g., Hypertext Markup Language (HTML) or JavaScriptelements, that can be locally injected into webpages visited by theuser. These webinjects can be injected through several techniques, forexample, through a browser extension application programming interface(API), browser process memory injection, or local network proxies. Thewebinjects can change the webpage to steal information (e.g., passwords,personal data, etc.), present additional content to the user (e.g.,advertising), and/or improve the user's browsing experience (e.g., byblocking advertising, presenting useful information, improvingfunctionality, etc.). Motives for inserting webinjects into webpages canrange from stealing information to displaying advertising, or evenimproving the user's experience.

SUMMARY

Disclosed herein are systems and methods to detect webinjects, and theirsources, in webpages. Some approaches for detecting software presenceremotely include (i) scanning the Internet for systems that publiclyexpose services and (ii) using a sinkhole to isolate a domain andreceive software connections as the software reaches the sinkhole,which, in some instances requires the subject domain to be expired orotherwise available. Another approach uses crawlers in a peer-to-peer(P2P) configuration, where the crawler joins the P2P network andreceives connections from other peers. However, this approach is limitedto P2P-enabled software. This method, if deployed through advertisingnetworks or other partners that can provide large amounts of traffic,can detect a considerable number of software installations by detectingthe presence of webinjects in the browser. The exemplary methods andsystems described herein can be used alone or complement any one or moreof the above methods to detect webinjects.

In a first aspect, a computer-implemented method is provided for thedetection of webinjects. The method includes delivering a detectionwebpage to a web browser. The detection webpage has detection codeconfigured to detect a presence of the webinject in the detectionwebpage. The method further includes inspecting, by the detection code,rendering of content of the detection webpage in the browser to detectwebinject content inserted into the detection webpage by the webinject.The webinject content includes one or more Hypertext Markup Language(HTML) components. The method further includes, if webinject content isdetected, generating, by the detection code, a fingerprint for each ofthe one or more HTML components; transmitting, by the detection code,the one or more fingerprints to an external server; and classifying, bythe external server, the webinject based on the one or morefingerprints.

Embodiments of the method can include any one or more of the belowfeatures. The method can include transmitting to the external sever, bythe detection code, one or more HTML components of the detectedwebinject content. The transmission can be on a portion of theexecutions of the detection code. The transmission can be on a smallsample of the executions. For example, the transmission can be on 1% orless of the executions of the detection code. These HTML component(s)can be transmitted with their respective fingerprints. The method caninclude identifying the origin software of the HTML component(s) by (i)searching for the HTML component(s) in sandboxed executions of softwareand/or (ii) searching through privately- and/or publicly-available datasources. The sandboxed executions of software may be associated with,may be related to, or possibly be the origin software. The method canfurther include generating a database including (a) the fingerprint(s),(b) the name of the origin software, (c) one or more features of thedetected webinject content, and/or (d) one or more capabilities (e.g.,intercepting communication or changing form contents) of the detectedwebinject content. This database can be used to classify webinject(s)detected on remote systems.

Delivering the detection webpage having detection code can furtherinclude configuring the detection code such that at least one of asource domain, a path, or an HTML structure of the detection webpage isconfigured to trigger an injection of the webinject content by thewebinject. The detection code can include JavaScript or Content SecurityPolicy (CSP). The detection webpage can be inserted into an HypertextMarkup Language (HTML) inline frame. The method can further includegenerating a classification of the one or more webinjects. Classifyingthe webinject based on the one or more fingerprints can further includedetermining an originating software of the webinject based on the one ormore fingerprints. Classifying the webinject based on the one or morefingerprints can further include mapping the one or more fingerprints toa feature set of the webinject. The detection webpage can be deliveredby a traffic generating entity. Delivering a detection webpage to a webbrowser can further include embedding, by the traffic generating entity,the detection webpage into an external webpage. Delivering a detectionwebpage to a web browser can occur upon receiving an indication of auser interaction with the content of a webpage, wherein the webpage isseparate from the detection webpage. The webinject content can includeadded or modified content by the webinject.

In a second aspect, a system is provided for detection of webinjects.The system includes one or more computer systems programmed to performoperations that include delivering a detection webpage to a web browser.The detection webpage has detection code configured to detect a presenceof the webinject in the detection webpage. The operations furtherinclude inspecting, by the detection code, rendering of content of thedetection webpage in the browser to detect webinject content insertedinto the detection webpage by the webinject. The webinject contentincludes one or more Hypertext Markup Language (HTML) components. Theoperations further include, if webinject content is detected,generating, by the detection code, a fingerprint for each of the one ormore HTML components; transmitting, by the detection code, the one ormore fingerprints to an external server; and classifying, by theexternal server, the webinject based on the one or more fingerprints.

Embodiments of the system can include any one or more of the belowfeatures. The operations can include transmitting to the external sever,by the detection code, one or more HTML components of the detectedwebinject content. The transmission can be on a portion of theexecutions of the detection code. The transmission can be on a smallsample of the executions. For example, the transmission can be on 1% orless of the executions of the detection code. These HTML component(s)can be transmitted with their respective fingerprints. The operationscan include identifying the origin software of the HTML component(s) by(i) searching for the HTML component(s) in sandboxed executions ofsoftware and/or (ii) searching through privately- and/orpublicly-available data sources. The sandboxed executions of softwaremay be associated with, may be related to, or possibly be the originsoftware. The operations can further include generating a database of(a) the fingerprint(s), (b) the name of the origin software, (c) one ormore features of the detected webinject content, and/or (d) one or morecapabilities (e.g. intercepting communication or changing form contents)of the detected webinject content. This database can be used to classifywebinject(s) detected on remote systems.

Delivering the detection webpage having detection code can furtherinclude configuring the detection code such that at least one of asource domain, a path, or an HTML structure of the detection webpage isconfigured to trigger an injection of the webinject content by thewebinject. The detection code can include JavaScript or Content SecurityPolicy (CSP). The detection webpage is inserted into an Hypertext MarkupLanguage (HTML) inline frame. The system can further include generatinga classification of the one or more webinjects. Classifying thewebinject based on the one or more fingerprints can further includedetermining an originating software of the webinject based on the one ormore fingerprints. Classifying the webinject based on the one or morefingerprints can further include mapping the one or more fingerprints toa feature set of the webinject. The detection webpage can be deliveredby a traffic generating entity. Delivering a detection webpage to a webbrowser can further include embedding, by the traffic generating entity,the detection webpage into an external webpage. Delivering a detectionwebpage to a web browser can occur upon receiving an indication of auser interaction with the content of a webpage, wherein the webpage isseparate from the detection webpage. The webinject content can includeadded or modified content by the webinject.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flowchart of an exemplary embodiment of a computerimplemented method for the remote detection of webinjects.

FIGS. 2A-2B are diagrams of exemplary embodiments of systems for theremote detection of webinjects.

FIG. 3 is a block diagram of an example computer system that can be usedin implementing the systems and methods described herein.

DETAILED DESCRIPTION

Disclosed herein are exemplary embodiments of systems and methods forthe remote detection of software, specifically by the detection ofwebinjects in a web browser. The detection and classification ofwebinjects can be particularly useful in researching how to bettersecure and protect computer systems, especially those connected to theInternet. In some instances, the automatic classification of webinjectsenabled by the systems and methods described herein provides significantincreases in processing efficiencies over conventional techniques.Remote detection can be achieved without the use of installed detectionsoftware on a system. For example, instead of installed detectionsoftware, software can be remotely detected on any web browser thatvisits an detection webpage. For the purposes of clarity andconciseness, the methods and systems of FIGS. 1-3 are described togetherherein below.

FIG. 1 is a flowchart of an exemplary embodiment of a computerimplemented method 100 for the remote detection of webinjects. FIGS.2A-2B are diagrams of exemplary embodiments of systems 200 and 201,respectively, for the remote detection of webinjects.

In step 102 of the method 100, one or more detection webpages 204 aredelivered to one or more web browser. In some embodiments, the one ormore detection webpages 204 can be distributed by a third party trafficgenerating entity 206 with access to a high volume of web traffic (e.g.,an advertising network, a website with a large number of daily visitors,etc.) that enables the one or more detection webpages 204 to reach alarge number of browsers 208 across the Internet. In some embodiments,the system 200 may receive an indication of a user interaction with thecontent of a webpage in a browser. For example, the one or moredetection webpages 204 can be delivered to a web browser 208 after auser clicks an advertisement in a webpage. The advertisement in thewebpage can link to the detection webpage(s) 204 and may be acquired forthe purpose of generating traffic to the detection webpage(s) 204. Forinstance, the advertisement can be configured such that, once a userclicks on an advertisement in the webpage, the browser is redirected tothe detection webpage. In some embodiments, the one or more detectionwebpages 204 are not delivered directly as the main page of the webbrowser 208. Instead, the one or more webpages 204 can be embedded by atraffic generating entity 206 into an external main webpage 209 (e.g., athird-party webpage that is not part of the detection system), as one ormore HTML inline frames (also referred to as an “iframe”). Iframesenable the embedding and/or displaying of a first HTML page into asecond HTML page. One advantage of using iframes is that, becauseiframes can be made invisible to the user (and can be sandboxed andisolated from the external main webpage 209), there is minimal to zeroimpact to the user's navigation experience and/or to the operation ofthe traffic generating entity.

While some webinjects are injected into as many webpages 209 as possible(and therefore, into every detection webpage 204 associated with thewebpage), some software (“webinject originator” 216) may only inject itswebinject(s) 212 when a specific website is visited (e.g., onlinebanking websites, social media websites, etc.). To detect this webinject212, one or more source domains, one or more paths, and/or an HTMLstructure of the detection webpage 204 is configured to match thewebinject targets (i.e., the online banking website, etc.). For example,the webinject 212 may only be injected by the originator 216 if thebrowser 208 is visiting the site:

-   -   webinjecttarget.com        Further, the originator 216 may be using a rule (e.g., a regular        expression) to search for the target domain of a webpage while        the browser 208 is loading the webpage. In many instances, this        rule is not sufficiently specific to the term        “webinjecttarget.com”. The uniform resource locator (URL) and/or        content of the detection webpage(s) 204 can be configured such        that, the detection webpage 204 can “bait” or trigger the        webinject 212 to be injected into the detection webpage 204        itself. An example of such an URL of the detection webpage 204        used in an iframe inserted by a traffic generating entity 206 on        an external webpage 209 is:    -   <iframe        src=“https://webinjecttarget.com.detectionsystem.com/webinjecttarget.com/login.aspx”>

In step 104, the detection code 202 is executed during and/or after therendering of the detection webpage 204 in the browser 208, to detect thewebinject 212 content on the detection webpage 204 Document Object Model(DOM). The detection code can inspect the rendering of the detectionwebpage 204 by using JavaScript functions that are triggered on specificwebpage rendering events. The webinject content can include one or moreHypertext Markup Language (HTML) components. The detection code inspectsthe rendering of the detection webpage 204 in the web browser 208. Thisinspection can be done through the use of a JavaScript function thatcompares the content of the detection webpage 204 after the content isrendered with the content that were delivered, through monitoringspecific JavaScript function calls that are commonly used by webinjects212 or through using content security policy (CSP) rules that trigger anaction on any change to the original delivered detection webpage 204.

In step 106, if detection code 202 detects webinject content in thedetection webpage 204, the detection code 202 generates a set offingerprints based on the webinject content. These fingerprints aregenerated using an algorithm that selects one or more webinject blocksof code. and normalizes the one or more blocks of code. An example of awebinject block of code is an inline HTML script tag added to thewebpage. There may be one or more blocks of code belonging to one ormore webinjects. For example, normalization of the blocks includeremoving parts of the blocks that are specific to the browser instance(such as unique identifiers), normalizing character case, etc. Thealgorithm then creates a unique identifier of each block's contents thatcan be smaller than the webinject content itself and that is unique fora particular content. This unique identifier, also referred to asfingerprint in this document, can be calculated using hashing functionsor even simpler cyclic redundancy check (CRC) algorithms that produce aunique number for a given input content.

In step 108, code 202 transmits the fingerprints to one or moreserver(s) 214 where they are stored and/or processed. Additionally, on asmall sample of the executions of the detection code 202, the detectedwebinject content is transmitted to an external server, along with therespective fingerprints. In an exemplary embodiment utilizing CSP, CSPreports are sent by browser 208 to server 214 if a webinject isdetected, the fingerprint is then calculated by a method similar to theone described above but on the server 214 and using the contents of theCSP report.

The one or more fingerprints can be used to classify the webinject 212and/or identify the originator 216 of the webinject 212. Thus, in step110, server 214 classifies the webinject 212 based on the receivedfingerprints. The server 214 can classify the detected webinject into aspecific category, based on a database that maps each of thefingerprints to details about the webinject 212 and/or originator 216.In some embodiments, method 100 can include generating a databaseincluding the fingerprint(s), the name of the origin software, and/or alist of features and/or capabilities of the detected webinject content.For example, features or capabilities can include interceptingcommunication or changing form contents. This database can be builtmanually and/or by automated processing of the webinject content blocksthat are sent, along with the respective fingerprints, to the server 214in step 108. Once these samples are received in server 214, they areused to identify the originator 216 of the webinject, by searching forthe presence of the same blocks of webinject code in the sandboxedexecution of the software and by searching other, open or commerciallyavailable data sources.

Computer-Based Implementations

In some examples, some or all of the processing described above can becarried out on a personal computing device, on one or more centralizedcomputing devices, or via cloud-based processing by one or more servers.In some examples, some types of processing occur on one device and othertypes of processing occur on another device. In some examples, some orall of the data described above can be stored on a personal computingdevice, in data storage hosted on one or more centralized computingdevices, or via cloud-based storage. In some examples, some data arestored in one location and other data are stored in another location. Insome examples, quantum computing can be used. In some examples,functional programming languages can be used. In some examples,electrical memory, such as flash-based memory, can be used.

FIG. 3 is a block diagram of an example computer system 300 that may beused in implementing the technology described in this disclosure.General-purpose computers, network appliances, mobile devices, or otherelectronic systems may also include at least portions of the system 300.The system 300 includes a processor 310, a memory 320, a storage device330, and an input/output device 340. Each of the components 310, 320,330, and 340 may be interconnected, for example, using a system bus 350.The processor 310 is capable of processing instructions for executionwithin the system 300. In some implementations, the processor 310 is asingle-threaded processor. In some implementations, the processor 310 isa multi-threaded processor. The processor 310 is capable of processinginstructions stored in the memory 320 or on the storage device 330.

The memory 320 stores information within the system 300. In someimplementations, the memory 320 is a non-transitory computer-readablemedium. In some implementations, the memory 320 is a volatile memoryunit. In some implementations, the memory 320 is a non-volatile memoryunit.

The storage device 330 is capable of providing mass storage for thesystem 300. In some implementations, the storage device 330 is anon-transitory computer-readable medium. In various differentimplementations, the storage device 330 may include, for example, a harddisk device, an optical disk device, a solid-date drive, a flash drive,or some other large capacity storage device. For example, the storagedevice may store long-term data (e.g., database data, file system data,etc.). The input/output device 340 provides input/output operations forthe system 300. In some implementations, the input/output device 340 mayinclude one or more of a network interface devices, e.g., an Ethernetcard, a serial communication device, e.g., an RS-232 port, and/or awireless interface device, e.g., an 802.11 card, a 3G wireless modem, ora 4G wireless modem. In some implementations, the input/output devicemay include driver devices configured to receive input data and sendoutput data to other input/output devices, e.g., keyboard, printer anddisplay devices 360. In some examples, mobile computing devices, mobilecommunication devices, and other devices may be used.

In some implementations, at least a portion of the approaches describedabove may be realized by instructions that upon execution cause one ormore processing devices to carry out the processes and functionsdescribed above. Such instructions may include, for example, interpretedinstructions such as script instructions, or executable code, or otherinstructions stored in a non-transitory computer readable medium. Thestorage device 330 may be implemented in a distributed way over anetwork, such as a server farm or a set of widely distributed servers,or may be implemented in a single computing device.

Although an example processing system has been described in FIG. 3 ,embodiments of the subject matter, functional operations and processesdescribed in this specification can be implemented in other types ofdigital electronic circuitry, in tangibly-embodied computer software orfirmware, in computer hardware, including the structures disclosed inthis specification and their structural equivalents, or in combinationsof one or more of them. Embodiments of the subject matter described inthis specification can be implemented as one or more computer programs,i.e., one or more modules of computer program instructions encoded on atangible nonvolatile program carrier for execution by, or to control theoperation of, data processing apparatus. Alternatively or in addition,the program instructions can be encoded on an artificially generatedpropagated signal, e.g., a machine-generated electrical, optical, orelectromagnetic signal that is generated to encode information fortransmission to suitable receiver apparatus for execution by a dataprocessing apparatus. The computer storage medium can be amachine-readable storage device, a machine-readable storage substrate, arandom or serial access memory device, or a combination of one or moreof them.

The term “system” may encompass all kinds of apparatus, devices, andmachines for processing data, including by way of example a programmableprocessor, a computer, or multiple processors or computers. A processingsystem may include special purpose logic circuitry, e.g., an FPGA (fieldprogrammable gate array) or an ASIC (application specific integratedcircuit). A processing system may include, in addition to hardware, codethat creates an execution environment for the computer program inquestion, e.g., code that constitutes processor firmware, a protocolstack, a database management system, an operating system, or acombination of one or more of them.

A computer program (which may also be referred to or described as aprogram, software, a software application, a module, a software module,a script, or code) can be written in any form of programming language,including compiled or interpreted languages, or declarative orprocedural languages, and it can be deployed in any form, including as astandalone program or as a module, component, subroutine, or other unitsuitable for use in a computing environment. A computer program may, butneed not, correspond to a file in a file system. A program can be storedin a portion of a file that holds other programs or data (e.g., one ormore scripts stored in a markup language document), in a single filededicated to the program in question, or in multiple coordinated files(e.g., files that store one or more modules, sub programs, or portionsof code). A computer program can be deployed to be executed on onecomputer or on multiple computers that are located at one site ordistributed across multiple sites and interconnected by a communicationnetwork.

The processes and logic flows described in this specification can beperformed by one or more programmable computers executing one or morecomputer programs to perform functions by operating on input data andgenerating output. The processes and logic flows can also be performedby, and apparatus can also be implemented as, special purpose logiccircuitry, e.g., an FPGA (field programmable gate array) or an ASIC(application specific integrated circuit).

Computers suitable for the execution of a computer program can include,by way of example, general or special purpose microprocessors or both,or any other kind of central processing unit. Generally, a centralprocessing unit will receive instructions and data from a read-onlymemory or a random access memory or both. A computer generally includesa central processing unit for performing or executing instructions andone or more memory devices for storing instructions and data. Generally,a computer will also include, or be operatively coupled to receive datafrom or transfer data to, or both, one or more mass storage devices forstoring data, e.g., magnetic, magneto optical disks, or optical disks.However, a computer need not have such devices. Moreover, a computer canbe embedded in another device, e.g., a mobile telephone, a personaldigital assistant (PDA), a mobile audio or video player, a game console,a Global Positioning System (GPS) receiver, or a portable storage device(e.g., a universal serial bus (USB) flash drive), to name just a few.

Computer readable media suitable for storing computer programinstructions and data include all forms of nonvolatile memory, media andmemory devices, including by way of example semiconductor memorydevices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks,e.g., internal hard disks or removable disks; magneto optical disks; andCD-ROM and DVD-ROM disks. The processor and the memory can besupplemented by, or incorporated in, special purpose logic circuitry.

To provide for interaction with a user, embodiments of the subjectmatter described in this specification can be implemented on a computerhaving a display device, e.g., a CRT (cathode ray tube) or LCD (liquidcrystal display) monitor, for displaying information to the user and akeyboard and a pointing device, e.g., a mouse or a trackball, by whichthe user can provide input to the computer. Other kinds of devices canbe used to provide for interaction with a user as well; for example,feedback provided to the user can be any form of sensory feedback, e.g.,visual feedback, auditory feedback, or tactile feedback; and input fromthe user can be received in any form, including acoustic, speech, ortactile input. In addition, a computer can interact with a user bysending documents to and receiving documents from a device that is usedby the user; for example, by sending web pages to a web browser on auser's user device in response to requests received from the webbrowser.

Embodiments of the subject matter described in this specification can beimplemented in a computing system that includes a back end component,e.g., as a data server, or that includes a middleware component, e.g.,an application server, or that includes a front end component, e.g., aclient computer having a graphical user interface or a Web browserthrough which a user can interact with an implementation of the subjectmatter described in this specification, or any combination of one ormore such back end, middleware, or front end components. The componentsof the system can be interconnected by any form or medium of digitaldata communication, e.g., a communication network. Examples ofcommunication networks include a local area network (“LAN”) and a widearea network (“WAN”), e.g., the Internet.

The computing system can include clients and servers. A client andserver are generally remote from each other and typically interactthrough a communication network. The relationship of client and serverarises by virtue of computer programs running on the respectivecomputers and having a client-server relationship to each other.

While this specification contains many specific implementation details,these should not be construed as limitations on the scope of what may beclaimed, but rather as descriptions of features that may be specific toparticular embodiments. Certain features that are described in thisspecification in the context of separate embodiments can also beimplemented in combination in a single embodiment. Conversely, variousfeatures that are described in the context of a single embodiment canalso be implemented in multiple embodiments separately or in anysuitable sub-combination. Moreover, although features may be describedabove as acting in certain combinations and even initially claimed assuch, one or more features from a claimed combination can in some casesbe excised from the combination, and the claimed combination may bedirected to a sub-combination or variation of a sub-combination.

Similarly, while operations are depicted in the drawings in a particularorder, this should not be understood as requiring that such operationsbe performed in the particular order shown or in sequential order, orthat all illustrated operations be performed, to achieve desirableresults. In certain circumstances, multitasking and parallel processingmay be advantageous. Moreover, the separation of various systemcomponents in the embodiments described above should not be understoodas requiring such separation in all embodiments, and it should beunderstood that the described program components and systems cangenerally be integrated together in a single software product orpackaged into multiple software products.

Particular embodiments of the subject matter have been described. Otherembodiments are within the scope of the following claims. For example,the actions recited in the claims can be performed in a different orderand still achieve desirable results. As one example, the processesdepicted in the accompanying figures do not necessarily require theparticular order shown, or sequential order, to achieve desirableresults. In certain implementations, multitasking and parallelprocessing may be advantageous. Other steps or stages may be provided,or steps or stages may be eliminated, from the described processes.Accordingly, other implementations are within the scope of the followingclaims.

Terminology

The phraseology and terminology used herein is for the purpose ofdescription and should not be regarded as limiting.

The term “approximately”, the phrase “approximately equal to”, and othersimilar phrases, as used in the specification and the claims (e.g., “Xhas a value of approximately Y” or “X is approximately equal to Y”),should be understood to mean that one value (X) is within apredetermined range of another value (Y). The predetermined range may beplus or minus 20%, 10%, 5%, 3%, 1%, 0.1%, or less than 0.1%, unlessotherwise indicated.

The indefinite articles “a” and “an,” as used in the specification andin the claims, unless clearly indicated to the contrary, should beunderstood to mean “at least one.” The phrase “and/or,” as used in thespecification and in the claims, should be understood to mean “either orboth” of the elements so conjoined, i.e., elements that areconjunctively present in some cases and disjunctively present in othercases. Multiple elements listed with “and/or” should be construed in thesame fashion, i.e., “one or more” of the elements so conjoined. Otherelements may optionally be present other than the elements specificallyidentified by the “and/or” clause, whether related or unrelated to thoseelements specifically identified. Thus, as a non-limiting example, areference to “A and/or B”, when used in conjunction with open-endedlanguage such as “comprising” can refer, in one embodiment, to A only(optionally including elements other than B); in another embodiment, toB only (optionally including elements other than A); in yet anotherembodiment, to both A and B (optionally including other elements); etc.

As used in the specification and in the claims, “or” should beunderstood to have the same meaning as “and/or” as defined above. Forexample, when separating items in a list, “or” or “and/or” shall beinterpreted as being inclusive, i.e., the inclusion of at least one, butalso including more than one, of a number or list of elements, and,optionally, additional unlisted items. Only terms clearly indicated tothe contrary, such as “only one of or “exactly one of,” or, when used inthe claims, “consisting of,” will refer to the inclusion of exactly oneelement of a number or list of elements. In general, the term “or” asused shall only be interpreted as indicating exclusive alternatives(i.e. “one or the other but not both”) when preceded by terms ofexclusivity, such as “either,” “one of,” “only one of” or “exactly oneof” “Consisting essentially of,” when used in the claims, shall have itsordinary meaning as used in the field of patent law.

As used in the specification and in the claims, the phrase “at leastone,” in reference to a list of one or more elements, should beunderstood to mean at least one element selected from any one or more ofthe elements in the list of elements, but not necessarily including atleast one of each and every element specifically listed within the listof elements and not excluding any combinations of elements in the listof elements. This definition also allows that elements may optionally bepresent other than the elements specifically identified within the listof elements to which the phrase “at least one” refers, whether relatedor unrelated to those elements specifically identified. Thus, as anon-limiting example, “at least one of A and B” (or, equivalently, “atleast one of A or B,” or, equivalently “at least one of A and/or B”) canrefer, in one embodiment, to at least one, optionally including morethan one, A, with no B present (and optionally including elements otherthan B); in another embodiment, to at least one, optionally includingmore than one, B, with no A present (and optionally including elementsother than A); in yet another embodiment, to at least one, optionallyincluding more than one, A, and at least one, optionally including morethan one, B (and optionally including other elements); etc.

The use of “including,” “comprising,” “having,” “containing,”“involving,” and variations thereof, is meant to encompass the itemslisted thereafter and additional items.

Use of ordinal terms such as “first,” “second,” “third,” etc., in theclaims to modify a claim element does not by itself connote anypriority, precedence, or order of one claim element over another or thetemporal order in which acts of a method are performed. Ordinal termsare used merely as labels to distinguish one claim element having acertain name from another element having a same name (but for use of theordinal term), to distinguish the claim elements.

What is claimed is:
 1. A system for detection of webinjects, the systemcomprising: one or more computer systems programmed to performoperations comprising: delivering a detection webpage to a web browser,the detection webpage comprising detection code configured to detect apresence of a webinject in the detection webpage; inspecting, by thedetection code, rendering of content of the detection webpage in thebrowser to detect webinject content in the detection webpage by thewebinject, the detected webinject content including one or moreHypertext Markup Language (HTML) components; and determining, by anexternal server, an origin software of the webinject based on thedetected webinject content.
 2. The system of claim 1, wherein theoperations further comprise: transmitting to the external server, by thedetection code, the one or more HTML components of the detectedwebinject content, the transmission performed by a portion of executionsof the detection code; identifying the origin software of the one ormore HTML components by at least one of: (i) searching for the one ormore HTML components in sandboxed executions of software or (ii)searching through privately- or publicly-available data sources; andgenerating a database of (a) one or more fingerprints, (b) a name of theorigin software, (c) one or more features of the detected webinjectcontent, and/or (d) one or more capabilities of the detected webinjectcontent.
 3. The system of claim 1, wherein delivering the detectionwebpage comprising detection code further comprises: configuring thedetection code such that at least one of (i) a source domain, (ii) apath, or (iii) an HTML structure of the detection webpage is configuredto trigger an injection of the webinject content by the webinject. 4.The system of claim 1, wherein the detection code comprises JavaScriptor Content Security Policy (CSP).
 5. The system of claim 1, wherein thedetection webpage is inserted into an Hypertext Markup Language (HTML)inline frame.
 6. The system of claim 1, wherein the detection webpage isdelivered by a traffic generating entity.
 7. The system of claim 6,wherein delivering a detection webpage to the web browser furthercomprises: embedding, by the traffic generating entity, the detectionwebpage into an external webpage.
 8. The system of claim 1, whereindelivering a detection webpage to the web browser occurs upon receivingan indication of a user interaction with content of a webpage, thewebpage separate from the detection webpage.
 9. The system of claim 1,wherein the operations further comprise: if webinject content isdetected, generating, by the detection code, a fingerprint for each ofthe one or more HTML components; transmitting, by the detection code,the one or more fingerprints to the external server; and classifying, bythe external server, the webinject based on the one or morefingerprints.
 10. The system of claim 9, wherein classifying, by theexternal server, the webinject based on the one or more fingerprintsfurther comprises: mapping the one or more fingerprints to a feature setof the webinject.